Shrimp genome sequence contains independent clusters of ancient and current Endogenous Viral Elements (EVE) of the parvovirus IHHNV

Background Shrimp have the ability to accommodate viruses in long term, persistent infections without signs of disease. Endogenous viral elements (EVE) play a role in this process probably via production of negative-sense Piwi-interacting RNA (piRNA)-like fragments. These bind with Piwi proteins to dampen viral replication via the RNA interference (RNAi) pathway. We searched a genome sequence (GenBank record JABERT000000000) of the giant tiger shrimp (Penaeus monodon for the presence of EVE related to a shrimp parvovirus originally named infectious hypodermal and hematopoietic necrosis virus (IHHNV). Results The shrimp genome sequence contained three piRNA-like gene clusters containing scrambled IHHNV EVE. Two clusters were located distant from one another in pseudochromosome 35 (PC35). Both PC35 clusters contained multiple sequences with high homology (99%) to GenBank records DQ228358 and EU675312 that were both called “non-infectious IHHNV Type A” (IHHNV-A) when originally discovered. However, our results and those from a recent Australian P. monodon genome assembly indicate that the relevant GenBank records for IHHNV-A are sequence-assembly artifacts derived from scrambled and fragmental IHHNV-EVE. Although the EVE in the two PC35 clusters showed high homology only to IHHNV-A, the clusters were separate and distinct with respect to the arrangement (i.e., order and reading direction) and proportional content of the IHHNV-A GenBank records. We conjecture that these 2 clusters may constitute independent allele-like clusters on a pair of homologous chromosomes. The third EVE cluster was found in pseudochromosome 7 (PC7). It contained EVE with high homology (99%) only to GenBank record AF218266 with the potential to protect shrimp against current types of infectious IHHNV. One disadvantage was that some EVE in PC7 can give false positive PCR test results for infectious IHHNV. Conclusions Our results suggested the possibility of viral-type specificity in EVE clusters. Specificity is important because whole EVE clusters for one viral type would be transmitted to offspring as collective hereditary units. This would be advantageous if one or more of the EVE within the cluster were protective against the disease caused by the cognate virus. It would also facilitate gene editing for removal of non-protective EVE clusters or for transfer of protective EVE clusters to genetically improve existing shrimp breeding stocks that might lack them. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08802-3.

Background Non-retroviral viral gene sequences inserted into host genomes during the course of evolution began to be called endogenous viral elements (EVE) around 2012 upon their discovery in vertebrate genomes [1]. However, evidence for EVE in animals was first reported for insects in 1999 [2], but the significance of the paper was not fully appreciated at the time of its publication. For shrimp, an EVE was described much later in 2007 [3] and was named genomic "non-infectious IHHNV" also before the term EVE was proposed.
In 2009, it was hypothesized that EVE (at the time called non-retroviral viral genome inserts) resulted from a mechanism that provided for heritable immunity in shrimp and insects [4,5]. In brief, the mechanism was proposed to involve host recognition of invading viral mRNA followed by its use as a substrate for host reverse transcriptase (RT) to produce variable fragments of viral copy DNA (vcDNA). The vcDNA fragments would subsequently be inserted into the shrimp genome where they would produce antisense RNA that could induce an RNA interference (RNAi) response. This would lead to inhibition of viral replication and allow the host to accommodate one or more viruses (i.e., allow viral accommodation) in persistent infections without signs of disease. Tests for predictions of the viral accommodation hypothesis proceeded much faster with insects than shrimp. By 2020, the basic predictions regarding EVE were proven for mosquitoes [6][7][8]. The studies in both Drosophila and mosquitoes revealed additional detailed mechanisms for an unpredicted, specific adaptive response involving the ability of vcDNA to produce siRNA resulting in an immediate, adaptive cellular and systemic antiviral RNAi response [7,9,10]. For details see a recent review [5].
One of the insect publications [10] revealed that the vcDNA produced by host RT in response to RNA virus infection came in two forms, one linear (lvcDNA) and one circular (cvcDNA). The latter could be specifically isolated relatively easily and was shown, when injected, to protect insect hosts againt the homologous virus. Following the protocols described for cvcDNA isolation in insects, it was shown [11] that IHHNV-cvcDNA could be extracted from IHHNV-infected P. monodon and that it could inhibit IHHNV replication in whiteleg shrimp P. vannamei challenged with IHHNV prepared from the infected P. monodon. During sequencing of the cvcDNA extract, it was also found that some of the cvcDNA constructs had high homology (98-99%) to the GenBank records DQ228358 and EU675312 known to arise from the host P. monodon genome (i.e., from EVE) originally called "non-infectious IHHNV Type-A" (IHHNV-A) [3,12]. Thus, the 2021 report [11] revealed that EVE can also give rise to cvcDNA, and it was hypothesized that the cvcDNA arose via EVE-produced RNA interacting with host.
During the time the study with IHHNV cvcDNA [11], a draft whole genome sequence (WGS) of a Thai P. monodon specimen was published [13], and we became interested to determine whether the WGS might contain EVE related to IHHNV. We considered this possible because the specimes we used for our cvcDNA work and the specimen used for the WGS project originated from the same domesticated shrimp breeding stock.

Clusters of IHHNV-EVE were found in pseudochromosomes 35 and 7
A general BlastN search of the Thai P. monodon WGS project (GenBank accession no. JABERT000000000) [13] using the queries GenBank record DQ228358 for noninfectious IHHNV-A and GenBank record AF218266 for infectious IHHNV confirmed the presence of three clusters of EVE derived from IHHNV (Fig. 1). Two of these IHHNV clusters ( Fig. 1A and B) were located would be advantageous if one or more of the EVE within the cluster were protective against the disease caused by the cognate virus. It would also facilitate gene editing for removal of non-protective EVE clusters or for transfer of protective EVE clusters to genetically improve existing shrimp breeding stocks that might lack them. Keywords: Viral accommodation, Endogenous viral element(s) (EVE), Infectious hypodermal and hematopoietic necrosis virus (IHHNV), Penaeus monodon whole genome sequence (See figure on next page.) Fig. 1 Schematic diagrams of IHHNV-EVE clusters in the draft Penaeus monodon WGS. A and B Sequence diagrams for Cluster 1 and 2, respectively, in PC35 with high homology to the non-infectious IHHNV-A query portion (1-3025 bp) of the GenBank record DQ228358. Some of the EVE sequences in the 2 clusters correspond to the same region of DQ228358 (i.e., the same color) but may differ in length and in reading direction indicated by arrowheads. Others are unique to each cluster. A zoom-in expansion of each EVE cluster is shown beneath. Colored numbers below indicate the nucleotide positions corresponding to GenBank record DQ228358. The portion of the record related to the host transposable-element portion of DQ228358 is indicated by a dark brown arrow. C Diagram of the IHHNV-EVE cluster in PC7 (GenBank accession no. JABERT010000007.1) with EVE showing high sequence identity (99%) to GenBank record AF218266 for infectious IHHNV. The numbers below the arrows represent the matching positions AF218266 Taengchaiyaphum et al. BMC Genomics (2022)  in pseudochromosome 35 (PC35) and showed high sequence homology (98-99%) to IHHNV in GenBank accession numbers DQ228358 and EU675312. The other cluster (Fig. 1C) was located in pseudochromosome 7 (PC7) and showed high sequence homology (98-99%) to GenBank accession number AF218266 for an extant type of infectious IHHNV. All three EVE clusters were clearly demarked by bracketing, direct host repeat sequences marked by red arrows in Fig. 1.

Characteristics of EVE homologous to non-infectious IHHNV-A in PC35
The GenBank records for DQ228358 (in P. monodon from East Africa) and EU675312 (in P. monodon from Australia) were initially referred to as "non-infectious IHHNV Type A" [3,12]. The IHHNV-related portions of the records for DQ228358 and EU675312 share 99% sequence identity, indicating that they arose from a similar, ancient type(s) of IHHNV that is distantly related to the types of IHHNV currently reported to cause disease in cultivated shrimp [14]. We now refer to the two GenBank records DQ228358 and EU675312 as EVE of non-infectious IHHNV-A. Here we use the DQ228358 sequence for comparative analysis but focus primarily on the portion of the sequence (1 to 3025 bp) that has homology to IHHNV. The remainder of the record (3026-4655 bp) constitutes shrimp host repeat and transposable element sequences. We suspected that EVE of IHHNV-A might be present in the Thai P. monodon WGS data. This was because we had already reported the occurrence of a variety of cvcDNA sequences with high sequence identity (98-99%) to non-infectious IHHNV-A in P. monodon specimens obtained from a different generation of the same breeding stock from which the Thai genome specimen was obtained [11]. Those cvcDNA sequences covered 68% (779-3930 nt) of the matching portion of the DQ228358 record (including some transposable element portion) and could be assembled into a single linear construct. However, at the time of that publication, we did not know the actual composition of the genome region(s) from which the cvcDNA sequences arose. All we could conclude at the time was that a variety of IHHNV-cvcDNA types were produced, some linked with host transposable element sequences and some not.
Our search of the Thai WGS revealed that PC35 did not contain a single continuous sequence with high sequence identity to IHHNV-A. Instead, it contained many high-identity EVE fragments of variable length that were scrambled with respect to proportion, position and reading direction in the non-infectious IHHNV-A reference sequences. These scrambled EVE were arranged in PC35 in two distinct clusters separated from one another by more than 80,000 bp ( Fig. 1A and B).
The individual EVE are shown in different colors to allow easy visual comparison of the EVE to matching regions of the IHHNV portion of the DQ228358 reference sequence. The positions of the EVE in PC35 are shown below each cluster diagram in black typeface while the positions relative to the DQ228358 record are shown above in colors to match those of each EVE. The colored vertical lines indicate the boundaries of the individual EVE. The number and lengths of the individual EVE and the gaps between them in the two clusters in PC35 are shown in Table 1. The EVE in the two PC35 clusters contain some common DQ228358 coverage, but they are distinct from one another in terms of the EVE types, lengths and reading directions when compared to the DQ228358 reference sequence ( Fig. 1A and B). The lengths of the EVE clusters bracketed by their respective repeat sequences were 6744 bp for Cluster 1 is and 14,993 bp for Cluster 2.
The unexpanded diagrams for Clusters 1 and 2 in Fig. 1A and B shared some structural features. For example, both of the two EVE clusters ( Fig. 1A and B) were bracketed by a pair of host, direct-repeat sequences (red arrows) of 2 × 561 bp in Cluster 1 (Fig. 1A) and 2 × 519 bp in Cluster 2 (Fig. 1B). In Cluster 1, the two repeats shared 98% sequence identity (547/561 bp) (Supplementary Fig. S1) while those in Cluster 2 shared only 77% sequence identity (400/519 bp) ( Supplementary Fig. S2). These bracketing direct repeat sequences (red arrows) demarked the boundaries of the non-infectious IHHNV-A EVE clusters. They matched many other regions in several pseudochromosomes of the Thai WGS database but at relatively low sequence identities (around 80% or less).
Within the boundaries of Cluster 1 there was a host retrotransposon element (dark brown arrow) of 561 bp ( Fig. 1A) that had 98% sequence identity to a matching portion of the 1395 bp host retrotransposon sequence that is part of GenBank record DQ228358. However, there was also a hit for 1354/1395 bases (98% identity) for the same retrotransposon portion of the DQ228358 sequence beginning very distantly from Cluster 1 at position 23,321,662 in PC35. All other spaces that separated the EVE fragments in Cluster 1 showed homology to various other shrimp host retrotransposon elements or host repeat sequences (not shown).
In contrast to Cluster 1, the pattern for Cluster 2 in PC35 (Fig. 1B) did not include any retrotransposon sequence with high homology to that in GenBank record DQ228358 within the bracketing 519 bp direct repeat sequences (red arrows). Like Cluster 1, all sequences separating the EVE showed homology to a variety of other shrimp host retrotransposon and repeat sequences (not shown). The bracketing sequences for EVE Clusters 1 and 2, matched many other regions in several pseudochromosomes of the Thai WGS database but at lower sequence identities of around 80% or less (not shown).
It is not known whether these two clusters in PC35 are linked on one chromosome of a diploid pair or whether they are matching "allele-like clusters" ("cluster-alleles") located one each in the chromosome pair represented by PC35. Now that we have the sequences of the two clusters in PC35, it will be possible to design specific primers to screen for each type in the current generation of the original P. monodon stock. By identifying one specimen positive for both clusters and mating it with one negative for both, obtaining about half the offspring each carrying only one or the other cluster would confirm whether or not they are cluster-alleles. Finally, the forms of these two clusters resemble those described from mosquitoes as piRNA-like gene clusters containing fragmented sequences from mosquito RNA viruses [6,7].
A recent publication [15] using DNA extracted from an Australian P. monodon specimen for genome analysis revealed one EVE cluster in their Scaffold group 97 (SG97) with high identity to GenBank EU675312 (i.e., the Australian version of IHHNV-A with 99% sequence identity to DQ228358). Given the information from our earlier publication and from the work reported herein and from the Australian publication, it is likely that the continuous sequence records for DQ228358 and EU675312 at GenBank are assembly artifacts, and that they were obtained from fragmented and scrambled target sequences that had sufficient overlap to result in their assembly into single linear sequences when using the sequence of infectious IHHNV (GenBank record AF218266) as a reference [3,12]. Huerlimann et al. [15] give a detailed analyses of the EVE cluster found on SG97 of their Australian P. monodon specimen plus a detailed comparison of it to the two EVE clusters that they (like us) discovered in PC35 of the Thai P. monodon WGS. Thus, those interested the detailed comparison should consult the Australian publication.

The IHHNV-EVE in PC7 showed high sequence homology to infectious IHHNV
Of greatest interest to us was the EVE cluster located in pseudochromosome 7 (PC7) (Fig. 1C) that showed high sequence identity (95-99%) only to GenBank accession number AF218266 from an extant form of infectious IHHNV [16]. In other words, this cluster contained no EVE with high sequence identity to GenBank records DQ228358 and EU675312 (i.e., non-infectious IHHNV Type A) that were found in EVE Clusters 1 and 2 in PC35. However, the general overall pattern for EVE Cluster 3 was similar to those of the two for IHHNV-A EVE clusters located on PC35. For example, the PC7 EVE were scrambled in terms of fragment location, portion and reading direction with respect to the AF218266 reference genome. They were also bracketed by two host directrepeat sequences (red arrows) of 1758 bp (98% sequence identity, Supplementary Fig. S3). The EVE were either contiguous or separated by host retrotransposon or repeat sequences. Again, the overall arrangement resembled the Pi-RNA-like gene clusters reported for EVE of RNA viruses in mosquitoes [6,7].
Our work with IHHNV-cvcDNA [11] and the work with mosquitoes [7,8] suggest that negative sense RNA transcripts of the IHHNV-EVE in this cluster would have the potential capability of inducing a host RNAi response against infection by homologous types of infectious IHHNV. If our conjecture above that the 2 IHHNV-EVE in PC35 are alleles turns out to be correct, the fact that there is only one IHHNV-EVE cluster in PC7 suggests that it might be a single cluster/allele with no matching IHHNV-EVE on the homologous chromosome. If this is so, it would provide an opportunity to prove whether or not the IHHNV-EVE in PC7 is protective against infectious IHHNV.
For example, primers could be designed to screen for the IHHNV-EVE cluster in PC7 in the current generation of the P. monodon stock. Several cluster-positive individuals could be mated with several cluster-negative individuals. If the cluster behaves in an allele-like manner, subsequent screening of the offspring from these crosses would reveal that at least one of the positive parents carried a single copy of the PC7 cluster/allele and would yield half the offspring positive for the allele and half negative for it. The offspring could then be challenged with infectious IHHNV followed by qPCR to determine infectious IHHNV loads in shrimp with and without the EVE cluster. A significantly lower mean IHHNV load in the cluster-positive offspring compared to the cluster-negative offspring would indicate that the cluster was protective. If so, the stock owner would then be able to use PCR to select crosses to maintain the protective cluster in subsequent stock generations.

Detailed analysis of the IHHNV-EVE in PC7
Altogether, the IHHNV-EVE cluster in PC7 (Fig. 1C) is 7267 bp in length and contains six EVE having high sequence identity with a current type of infectious IHHNV (GenBank record AF218266. The EVE range from 80 to 3326 bp in length (Table 1) and they are scrambled with respect to portion, position and reading direction when compared to the IHHNV reference genome AF218266. In comparison to the two EVE clusters for non-infectious IHHNV in PC35, three of the EVE in Cluster 3 are much longer and the gaps between the EVE are shorter or do not exist. The significance of these differences is currently unknown. However, the general pattern of EVE scrambling, bracketing by host repeat sequences and separation or not by host transposable element or repeat sequences is similar to that in the other two EVE clusters in PC35.
To eliminate the possibility that the IHHNV-EVE cluster in PC7 with high identity to GenBank record AF218266 was the result of IHHNV genomic DNA contamination, we obtained an archived DNA sample from or the Thai shrimp genome project and used PCR primers 98F/3762R designed to amplify approximately 94% the whole IHHNV genome sequence [11]. This failed to give the expected amplicon, indicating the absence of contaminating, full genomic DNA of IHHNV in the Thai genome project DNA (Fig. 2).

Potential for false positive PCR detection of infectious IHHNV
A BlastN analysis of the IHHNV-EVE cluster in PC7 was carried out against the target sequences for the PCR detection method ("309 method") recommended for specific detection of infectious IHHNV in the OIE diagnostic manual [17]. The 309 method was designed to be specific for detection of infectious IHHNV [12]. Our analysis of the EVE cluster in PC7 revealed that 3 of the EVE (2, 3 and 5) contained sequences with 95% identity and 100% coverage for the target of the 309 method based on the AF218266 reference sequence (Table 2). However, despite that less than 100% overall sequence identities of the potential EVE targets, the primer sequences for the 309-method matched 100% with the relevant target sequences in all 3 of the potential EVE targets (Supplementary Fig. S4). Thus, the potential amplicons would be equal length (indistinguishable) from those that would arise from infectious IHHNV. Thus, the result for the specimen used for the genome project would constitute false positive test results for infectious IHHNV.
To test for this possibility of false positive test results, a sample of the same archived genomic DNA from the Thai P. monodon genome project (above) was subjected to PCR testing using the 309F/R primers recommended by OIE for detection of infectious IHHNV. This test gave an amplicon indistinguishable from that of the positive control containing DNA of infectious IHHNV and constituted a false positive test result for the presence of infectious IHHNV (Fig. 3).

Confirmation of the IHHNV-EVE cluster in PC7 by PCR and amplicon sequencing
We further confirmed that the IHHNV-EVE cluster in PC7 was real and not the result of assembly errors that somehow included contaminating fragments of the   Fig. 1). Both gave amplicons of the predicted size based on the WGS of the IHHNV-EVE cluster (Fig. 4). Furthermore, sequencing of the amplicon from R1 revealed 99% identity to the matching region of the whole shrimp genome record (Supplementary Fig. S5). From the result, the unnatural, disjunctive IHHNV sequence in the amplicon could not have arisen from contaminating IHHNV DNA and must have originated from its target sequence in the P. monodon DNA template.

Trade consequences from shrimp stocks carrying the PC7 IHHNV-EVE
In addition to the recommended method in the OIE manual, there are two published methods proposed to aid in the detection of infectious IHHNV while avoiding false positives arising from IHHNV-EVE. One of these [18] is an isothermal PCR detection method focused on a part of the infectious IHHNV genome that appeared to be of low prevalence or absent from IHHNV-EVE revealed by screening P. monodon from Australia. Our analysis of the target sequence for the primers from this method in the PC7 EVE for infections IHHNV (Table 2) revealed only one target sequence in the cluster in which there was a single base mismatch for each primer. There was a low calculated effect on the binding efficiency of the primers, so it is hard to predict whether or not the method would give a false positive test for infectious IHHNV with the potential PC7 target. In contrast, a more recent qPCR method designed to avoid false positive test results for infectious IHHNV that arise from IHHNV-EVE [19] would not be effective with the PC7 EVE cluster because there are three target sequences, each with 100% match to the primer and probe sequences.
In the past, without knowledge of EVE, shrimp specimens giving positive PCR test results with the methods above would have been considered IHHNV-infected and the positive shrimp specimens would have been discarded during the screening process to develop specific pathogen free (SPF) breeding stocks. In this way, it is possible that potentially protective EVE were discarded during stock development. This problem of false positive test results for infectious IHHNV arising from EVE has been raised previously [14,20].
In addition, it is likely that shrimp products carrying the PC7 cluster would give false positive test results for the presence of infectious IHHNV, and this might result in the rejection of exported shrimp broodstock, shrimp PL and frozen shrimp products by countries that use the target of the OIE-recommended PCR method to screen imports for the presence of infectious IHHNV. It might be argued that use of a PCR method to detect such EVE followed by breeding selection to eliminate them from breeding stocks would be the simplest way to avoid this problem. However, such an approach might also remove naturally evolved, heritable resistance to IHHNV infection and lead to production problems for shrimp farmers.
In addition to the above potential for false positive results, another possibility has previously been raised [21]. This might occur in an SPF stock that had been developed by use of an internationally standardized PCR method to confirm absence of a particular virus based on a mutually agreed, small fragment of the target virus genome. This process would result not only in the discard of infected shrimp but also shrimp that carry EVE containing the PCR target sequence. Since some EVE may be protective and others not, it would be wise to choose the target region of any standard method to be in a genome sequence known to have no or low potential as a protective EVE.
However, this may not solve all the problems because individual shrimp in the stock population would carry Fig. 3 Agarose gel showing PCR amplicons obtained using archived P. monodon DNA from the Thai genome project as the template with the 309F/R primers recommended by OIE for detection of infectious IHHNV. Pm = P. monodon DNA from the Thai genome project as a template; P = IHHNV genomic DNA as a positive control. The arrow indicates the 309 bp-PCR amplicons from Pm and P a variety of difference EVE cluster/alleles for a particular virus. Thus, it is possible that 2 individuals in a stock population might each carry one of 2 EVE in different cluster/alleles but in the same reading direction and have the potential to re-establish the PCR detection target by crossover. Specifically, the separated EVE fragments in the two shrimp would have to have sequence overlap but with the EVE in one individual containing the forward 5′ primer target sequence for the chosen standard method but lacking the 3′ primer sequence, while the other individual would contain the 3′ primer sequence but lack the 5′ sequence. These EVE would escape the screening process to remove stock individuals with EVE that carry the target for the standard PCR detection method chosen. However, at some unpredictable time in many crosses and generations, these to sequences might end up as cluster/alleles in a single individual by random assortment of chromosomes from its parents. If so, crossover events might occur in the overlap region to re-establish the PCR target sequence and give rise to a small portion of PCR positive individuals (i.e., "pop-up" positive individuals) in the offspring of such individuals. This could happen even in a population of shrimp with a good history of freedom from the target virus [20]. For these reasons, it is essential that development of standard PCR detection methods involve a process of investigation and consultation among regulatory agencies and the companies or agencies that develop and maintain SPF shrimp breeding stocks for shrimp farmers.

The evolutionary advantage of jumbled contents of EVE clusters
We believe that the characteristic scrambling of EVE in piRNA-like clusters when compared to their arrangements in the originating genome in both insects and shrimp is worthy of some contemplation. We propose that this phenomenon may have evolved because it prevents the easy re-establishment of complete, infectious viral genome sequences in the host genome by the process of recombination between EVE-cluster alleles. On the other hand, the ability of the EVE to produce RNA transcripts and give rise to vcDNA [11] would seem to open the possibility that rare recombination events might occur between infecting viruses and EVE products (RNA or DNA) and be an additional potential source viral variation. As far as we know, this possibility has not been explored in shrimp or insects.

Conclusions
Revelation of piRNA-gene-like clusters of EVE from specific viral types is worth further investigation. This may exemplify an advantageous evolutionary development that arose because collections of EVE from specific viral types into single linkage clusters would assure their transmission to offspring as potential "protective antiviral EVE packages" (PEVEP). It would also be an evolutionary advantage to have PEVEP for different viruses located on different chromosomes to assure that maximum variation in PEVEP combinations would occur simply by random assortment of chromosomes during the production of gametes. If such PEVEP operated like alleles, additional variation would be possible via crossover during meiosis. Without viral genome fragmentation and scrambling (including reading direction) during the formation of these piRNA gene-like clusters, it might be possible that crossover between two suitable APP would occasionally re-establish a full, infectious viral genome or even give rise to new virulent types. Fracture and jumbling may have evolved to circumvent this possibility. However, it does not eliminate the possibly of popup, false-positive PCR amplicons that might arise from crossover between EVE that re-establish the sequence of a PCR target for a standard viral detection method. Nor does it eliminate the possibility of RNA and vcDNA arising from EVE might on very rare occasions contribute to the evolution of viruses via recombination events.

Bioinformatics analysis of a draft P. monodon genome for IHHNV
The sequences of non-infectious IHHNV (GenBank record DQ228358) and infectious IHHNV (GenBank record AF218266) were used as the subject reference sequences for BlastN analysis (https:// blast. ncbi. nlm. nih. gov/ Blast. cgi) of a recently released whole draft genome (WDG) of the giant tiger shrimp P. monodon (GenBank accession no. JABERT000000000) [13]. Prediction of conserved DNA repeat sequences was determined using the Dfam database of repetitive DNA families (https:// www. dfam. org/ home) [21]. The MS PowerPoint program version 2019 was used to draw Fig. 1.

PCR detection and sequencing
The primer sequences used in this study were shown in Table 3. PCR and amplicons sequencing was used to confirm the IHHNV-EVE cluster in PC7. The PCR for IHHNV-EVE detection was performed in 12.5 μl consisting of 1X One-Taq PCR master mix (NEB, USA), 0.2 μl of each forward and reverse primer (PC7-R1F/PC7-R1R or PC7-R2F/ PC7-R2R) and 100 ng of shrimp DNA extract. The PCR cycle was performed using PCR Thermocycler (Bio-Rad, USA) in total 35 cycles which consisted of initial denaturation at 94 °C for 5 min then followed by 35 cycles of 95 °C for 30 sec, 55 °C for 30 sec, 72 °C for 30 sec. Final extension at 72 °C for 5 min was applied. The PCR protocol for the 309 method was similar to those of the IHHNV-EVE detection. The long-amp PCR using 98F/3762R primers was performed according to previous described [11]. Briefly, the PCR reaction was performed in 12.5 μl by using Long-Amp ™ Taq PCR mix (New England Biolab, USA). The PCR reaction consisted of Long-Amp Taq PCR reaction mix, 0.4 μM of forward and reverse primers (98F/3762R), 1 U Long-Amp ™ Taq polymerase, and 100 ng of shrimp DNA extract. The PCR cycle was started with initial denaturation at 94 °C for 30 s then followed by 35 cycles of 94 °C for 20 s, 55 °C for 30 s, 72 °C for 2.5 min and final extension at 72 °C for 10 min. The PCR amplicons were determined by 1.5% agarose gel electrophoresis and ethidium bromide staining before subjected to extract from agarose gel by gel/PCR extraction kit (GeneAid Biotech, Taiwan). The gel purified PCR amplicons were submitted for sequencing by their specific primers (ATGC, Thailand).